SePaS: Word sense disambiguation by sequential patterns in sentences
نویسندگان
چکیده
An open problem in natural language processing is word sense disambiguation (WSD). A word may have several meanings, but WSD is the task of selecting the correct sense of a polysemous word based on its context. Proposed solutions are based on supervised and unsupervised learning methods. The majority of researchers in the area focused on choosing proper size of ‘n’ in n-gram that is used for WSD problem. In this research, the concept has been taken to a new level by using variable ‘n’ and variable size window. The concept is based on the iterative patterns extracted from the text. We show that this type of sequential pattern is more effective than many other solutions for WSD. Using regular data mining algorithms on the extracted features, we significantly outperformed most monolingual WSD solutions. The state-of-the-art results were obtained using external knowledge like various translations of the same sentence. Our method improved the accuracy of the multilingual system more than 4 percent, although we were using monolingual features.
منابع مشابه
Bangla Word Sense Disambiguation System using Dictionary Based Approach
Bangla Word sense disambiguation (WSD) is the task of selecting the appropriate senses of a Bangla word (i.e. meaning) in a given Bangla sentence when the word has multiple meanings. This thesis work focuses on the issues in designing a Bangla Word Sense Disambiguation System to remove the lexical semantic ambiguity by using dictionary based approach. This system works with two major steps: par...
متن کاملXLING: Matching Query Sentences to a Parallel Corpus using Topic Models for WSD
This paper describes the XLING system participation in SemEval-2013 Crosslingual Word Sense Disambiguation task. The XLING system introduces a novel approach to skip the sense disambiguation step by matching query sentences to sentences in a parallel corpus using topic models; it returns the word alignments as the translation for the target polysemous words. Although, the topic-model base match...
متن کاملLearning a Robust Word Sense Disambiguation Model using Hypernyms in Definition Sentences
This paper proposes a method to improve the robustness of a word sense disambiguation (WSD) system for Japanese. Two WSD classifiers are trained from a word sense-tagged corpus: one is a classifier obtained by supervised learning, the other is a classifier using hypernyms extracted from definition sentences in a dictionary. The former will be suitable for the disambiguation of high frequency wo...
متن کاملSupervised Word Sense Disambiguation with Sentences Similarities from Context Word Embeddings
In this paper, we propose a method that employs sentences similarities from context word embeddings for supervised word sense disambiguation. In particular, if N example sentences exist in training data, an N-dimensional vector with N similarities between each pair of example sentences is added to a basic feature vector. This new feature vector is used to train a classifier and identification. ...
متن کاملSemantic Role Labeling Approach for Evaluation of Text Coherence
Detection of semantic roles associated with linguistic elements is important to the textual classification of communicative context into specific identities. In this paper, a new model for semantically identifying sentences is presented through contextual patterns. The proposed contextual pattern originated its structure from a labeling process of the semantic roles provided by constituents of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Natural Language Engineering
دوره 21 شماره
صفحات -
تاریخ انتشار 2015